Model-Based Reinforcement Learning With Kernels for Resource Allocation in RAN Slices

نویسندگان

چکیده

Network slicing is a key feature of 5G and beyond networks, allowing the deployment separate logical networks (network slices), sharing common underlying physical infrastructure, characterized by distinct descriptors behaviors. The dynamic allocation network resources among coexisting slices should address challenging trade-off: to use efficiently while assigning each slice sufficient meet its service level agreement (SLA). We consider time-frequency from new perspective: design control algorithm capable learning over operating network, keeping SLA violation rate under an acceptable during process. For this purpose, traditional model-free reinforcement (RL) methods present several drawbacks: low sample efficiency, extensive exploration policy space, inability discriminate between conflicting objectives, causing inefficient and/or frequent violations To overcome these limitations, we propose model-based RL approach built upon novel modeling strategy that comprises kernel-based classifier self-assessment mechanism. In numerical experiments, our proposal, referred as RL, clearly outperforms state-of-the-art algorithms in terms fulfillment, resource computational overhead.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Resource Allocation Using Decompositional Reinforcement Learning

This paper considers a novel application domain for reinforcement learning: that of “autonomic computing,” i.e. selfmanaging computing systems. RL is applied to an online resource allocation task in a distributed multi-application computing environment with independent time-varying load in each application. The task is to allocate servers in real time so as to maximize the sum of performance-ba...

متن کامل

Market Model for Resource Allocation in Emerging Sensor Networks with Reinforcement Learning

Emerging sensor networks (ESNs) are an inevitable trend with the development of the Internet of Things (IoT), and intend to connect almost every intelligent device. Therefore, it is critical to study resource allocation in such an environment, due to the concern of efficiency, especially when resources are limited. By viewing ESNs as multi-agent environments, we model them with an agent-based m...

متن کامل

Reinforcement Learning for Resource Allocation in Multiuser OFDM Systems

Abstract In cellular mobile communications the subcarriers are repeatedly used for best utilizing the assigned frequency spectrum. The assignment of channels for users is complex and involves high computation time. Wong et al [15] proposed a heuristic algorithm to achieve the suboptimal solution for sub-carrier assignment. This proposal was based on constructive assignment in real time situatio...

متن کامل

Deep Reinforcement Learning for Resource Allocation in V2V Communications

In this article, we develop a decentralized resource allocation mechanism for vehicle-to-vehicle (V2V) communication systems based on deep reinforcement learning. Each V2V link is considered as an agent, making its own decisions to find optimal sub-band and power level for transmission. Since the proposed method is decentralized, the global information is not required for each agent to make its...

متن کامل

Reinforcement learning with kernels and Gaussian processes

Kernel methods have become popular in many sub-fields of machine learning with the exception of reinforcement learning; they facilitate rich representations, and enable machine learning techniques to work in diverse input spaces. We describe a principled approach to the policy evaluation problem of reinforcement learning. We present a temporal difference (TD) learning using kernel functions. Ou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Wireless Communications

سال: 2023

ISSN: ['1536-1276', '1558-2248']

DOI: https://doi.org/10.1109/twc.2022.3195570